Rank correlation under categorical confounding
نویسنده
چکیده
Correspondence: [email protected] Department of Decision Sciences, HEC Montréal, 3000 chemin de la Côte-Sainte-Catherine, H3T 2A7 Montréal, Canada Abstract Rank correlation is invariant to bijective marginal transformations, but it is not immune to confounding. Assuming a categorical confounding variable is observed, the author proposes weighted coefficients of correlation for continuous variables developed within a larger framework based on copulas. While the weighting is clear under the assumption that the dependence is the same within each group implied by the confounder, the author extends the Minimum Averaged Mean Squared Error (MAMSE) weights to borrow strength between groups when the dependence may vary across them. Asymptotic properties of the proposed coefficients are derived and simulations are used to assess their finite sample properties.
منابع مشابه
"Quasi-REML" correlation estimates between production and health traits in the presence of selection and confounding: a simulation study.
Performance of the "quasi-REML" method for estimating correlations between a continuous trait and a categorical trait, and between two categorical traits, was studied with Monte Carlo simulations. Three continuous, correlated traits were simulated for identical populations and three scenarios with either no selection, selection for one moderately heritable trait (Trait 1, h2 = .25), and selecti...
متن کاملGeneralized Rank Tests for Progressive Censoring Procedures
In clinical trials and life testing problems the responses are time-sequential in nature and a compl~t~ experimentation may involve a prohibitive amount of time and cost •. • Fo't "this reason, progressive· censoring schemes (PCS) are often advocated with a view to terminating the experimentation at the earliest possible stage if the accumulated statistical evidence warrants to do so. In this d...
متن کاملA New Probabilistic Approach in Rank Regression with Optimal Bayesian Partitioning
In this paper, we consider the supervised learning task which consists in predicting the normalized rank of a numerical variable. We introduce a novel probabilistic approach to estimate the posterior distribution of the target rank conditionally to the predictors. We turn this learning task into a model selection problem. For that, we define a 2D partitioning family obtained by discretizing num...
متن کاملEM algorithm in Gaussian copula with missing data
Rank-based correlation is widely used to measure dependence between variables when their marginal distributions are skewed. Estimation of such correlation is challenged by both the presence ofmissing data and the need for adjusting for confounding factors. In this paper, we consider a unified framework of Gaussian copula regression that enables us to estimate either Pearson correlation or rank-...
متن کاملContingency Tables: Test For Independence
• If we have data that are measurable (eg. height, time, weight, age, test score, etc.) then we can use the coefficient of correlation (r) to measure the strength of the linear association between two sets of data. • If we have two sets of rankings, then we can use Spearman's Rank Order Correlation Coefficient to measure the strength of the association between them. • However, both of these tec...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017